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The existence of mammalian female germline stem cells (FGSCs) indicates that mammalian ovaries possess 
germline stem cells analogous to testis, and continue to produce gametes postnatally, which provides new 
insights into female fertility. In this study, we compared the morphological and molecular characteristics 
between FGSCs and spermatogonial stem cells (SSCs) by analysis of morphology, immunofluorescence, 
alkaline phosphatase activity assay, reverse transcription polymerase chain reaction (RT-PCR) and 
microarray hybridization. The results demonstrated that the morphology and growth patterns of FGSCs are 
similar to those of SSCs. Microarray analysis of global gene expression profiles of FGSCs and SSCs showed 
similar signatures in the transcriptome level. A list of 853 co-highly expressed genes (CEG) in female and 
male germline stem cells may be responsible for the morphological and molecular similarity. We 
constructed a continuous network of the CEG based on I2D protein-protein interaction database by breadth 
first search. From the network, we could observe the interactions of the CEG may be responsible for 
maintaining the properties of germline stem cells. This study was the first attempt to compare 
morphological and molecular characteristics between FGSCs and SSCs. These findings would provide some 
clues for further research on mammalian FGSCs. 



Stem cells are cells that can renewing themselves and can differentiate into mature specialized body cells. 
Using these characteristics, as well as morphology and surface markers, many types of adult stem cells have 
been identified 1,2 . Germline stem cells share two characteristics with adult stem cells, in that they can self- 
renew and provide daughter cells 3 ; however, they are also responsible for genetic information transmission from 
parents to subsequent generations 4 . In mammals, male germline stem cells (spermatogonial stem cells, SSCs) were 
identified and confirmed very early, because the male can produce sperm throughout its entire lifetime 5 . In the 
conventional theory, however, female germline cells of most mammalian species enter meiosis and are arrested at 
the diplotene stage of meiotic prophase I before birth 6 8 . This theory has been challenged by findings suggesting 
that postnatal oogenesis occurs in mouse ovaries 9 . 

Recently, we showed that female germline stem cells (FGSCs) from neonatal and adult mouse ovaries could be 
successfully isolated and purified using immunomagnetic sorting for the mouse vasa homolog (MVH). These FGSCs 
are located on the cortical surface of ovaries. Furthermore, an FGSC line was established from neonatal mice, with a 
normal karyotype and high telomerase activity. Adult FGSCs have also been cultured for more than 6 months. 
FGSCs in long-term culture maintain their capacity to produce normal oocytes and fertile offspring after trans- 
plantation into ovaries 10 . Several germ cell-specific markers have been screened to improve the efficiency of puri- 
fication. Using the germline- specific protein Fragilis, the efficiency of FGSC purification was remarkably enhanced 
compared with that using MVH 11 . We have also successfully generated transgenic and gene knockdown mice using 
FGSC 12 . Pacchiarotti et a\P and Hu et al. u also demonstrated the existence of a population of germline stem cells in 
postnatal mouse ovaries. White et al. successfully isolated and purified FGSCs from adult mice and reproductive-age 
women using fluorescence-activated cell sorting for Ddx4 (also called Vasa) 15 . Moreover, we have successfully 
generated transgenic rats using an FGSC line established from postnatal rats 16 . However, some recentiy published 
papers declared no mitotically active FGSCs exist in postnatal mouse ovaries 17,18 . It is necessary to describe the 
morphological and molecular characters of female germ line stem cells and compare similarity of FGSCs and SSCs. 

In the present study, we compared the morphological characteristics of FGSCs and SSCs and performed 
microarray analysis of the global gene expression profiles of both cell types. Although sex differentiation occurs 
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at 11.5 days post-coitum (dpc) 19 , there are still some similarities in 
the whole gene expression profiles between FGSCs and SSCs. The co- 
highly expressed genes (CEG) shared by FGSCs and SSCs may be 
responsible for those similarities. Functional annotation analysis of 
the CEG enriched GO terms related to cell cycle, cell division and 
proliferation and transcription regulation. Overlaps between the 
CEG and genes overexpressed in embryonic stem cell (ESCs), neural 
stem cells (NSCs) and hematopoietic stem cells (HSCs) validated the 
sternness of FGSC and SSC 20 . Motif screening identified several tran- 
scription factors related to cell cycle and stem cell maintenance. To 
gain further insights into the interactions of the CEG responsible for 
the similar properties of FGSCs and SSCs, we constructed a continu- 
ous protein-protein interaction network of the CEG based on I2D 
database by breadth first search. From the network we observed the 
interactions of the CEG responsible for maintaining the properties of 
germline stem cells. We obtain 35 hubs whose connectivity ranked 
within the top 10% most connected nodes in the maximum continu- 
ous network. According to the theory of scale-free networks, those 
hubs would play important roles in the continuous network that may 
be responsible for maintaining the similarities between FGSCs and 
SSCs. The functional analyses of the CEG and motif enrichment of 
stem special transcription factors led us to hypothesize that the CEG 
are responsible for the similar morphological and molecular char- 
acters shared by FGSCs and SSCs. The constructed continuous net- 
work of the CEG will provide some useful clues for studies focusing 
on the mechanisms of maintaining the unique properties of germline 
stem cells. 

Results 

Morphology and growth patterns of FGSCs are similar to those of 
SSCs. The morphology of FGSCs is very similar to that of SSCs. They 
are characterized by a round shape with a large bright spherical 
nucleus and an obvious boundary between the cytoplasm and 



nucleus (Fig. 1A). After culturing for 2 days, the isolated FGSCs 
appeared as grape-like clusters that are usually found in cultured 
SSCs (Fig. 1A). The nuclear boundary of the proliferated FGSCs 
was observed under light microscope, and was consistent with the 
result using Hoechst33342 staining (Supplementary Fig. SI online). 

The FGSCs possess germ cell properties similar to those of SSCs. 

To confirm that the FGSCs possess germline stem cell-specific 
properties, we performed the following experiments. First, we 
detected the expressions of germ cell-specific markers in the 
cultured FGSCs using reverse transcription polymerase chain 
reaction (RT-PCR). The results showed that FGSCs expressed 
MVH 11 , Dazl (a germ cell-specific RNA-binding protein) 22 , Oct4 (a 
germ cell-specific transcriptional factor) 23 and Fragilis (a germ cell- 
specific expression protein) 24 (Fig. IB). Immunofluorescence 
analysis showed that the FGSCs were Oct4 and MVH positive, as 
were SSCs (Fig. 1C). We also detected alkaline phosphatase staining 
of FGSCs, as has been previously described 10 (Fig. 1C). All the 
characteristics detected were consistent with previously identified 
female germline stem cells. Immunofluorescence analysis of BrdU 
incorporation and expression of MVH demonstrated that the FGSCs 
possessed proliferative ability (Fig. 2). These germ cell properties of 
FGSCs are similar to those of SSCs. 

The gene expression profiles in FGSCs and SSCs show similar 
signatures. We detected 19004 probes that were expressed in the 
gene expression profiles of FGSCs and SSCs. Initially we explored 
the microarray results of FGSCs and SSCs at the transcriptional level 
using unsupervised hierarchical clustering of the global gene 
expression profiles. A heat map for the whole gene expression 
profiles of FGSCs and SSCs showed obvious blocks representing 
similar gene expressing patterns for certain genes. As shown in 
Fig. 3A, the gene expression profiles of FGSCs and SSCs were 
similar. To identify key genes conserved in germline stem cells that 
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Figure 1 | Confirmation that FGSCs possess germline stem cell properties similar to SSCs. (A) Freshly isolated FGSCs and SSCs have a round shape with 
large, bright, spherical nuclei; there is an obvious boundary between the cytoplasm and the nucleus. Isolated FGSCs and SSCs form grape-like 
clusters after 2 days of culture. Scale bar =10 um. (B) RT-PCR analysis of germ cell markers. M, 100 bp DNA marker; lane l,freshlyisolatedFGSC;lane2, 
2-day cultured FGSC; lane 3, freshly isolated SSC; lane 4, 2-day cultured SSC; lane 5, no template control. (C) Immunofluorescence analysis of MVH, Oct4 
as well as alkaline phosphatase (AP) was used for characterization, and DPAI was used to stain the nucleus. Similar to SSCs, FGSCs are positive for 
MVH and Oct4. FGSCs are weakly stained by the AP assay, similar to SSCs. Scale bar =10 um. 
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Figure 2 | FGSCs possess proliferative abilities similar to SSC. Dual 
immunofluorescence for MVH and BrdU showed that FGSC show mitotic 
activity and maintaining germline properties, similar to SSC. Scale bar = 
10 um. 



might be responsible for the similar morphology of FGSCs and 
SSCs, we generated a list of CEG shared by both cell types based 
on the same normalized threshold (see methods). After removing 
housekeeping genes, there were 1273 and 1193 highly expressed 
genes in FGSCs and SSCs, respectively, of which 853 (52.88%) 
were CEG (Fig. 3B). The list of CEG is shown in Supplementary 
Table SI. The expression levels of several germ cell-specific genes 
in FGSCs and SSCs are shown in Fig. 3C. MVH, Dazl, Oct4 and 
Fragilis, which are commonly used to identify germ cells, were 
expressed in FGSCs and SSCs at levels close to the high expression 
threshold. Tdrkh, Akt3, Gml673, Hba-al, MovlOll and Fkbp6, 
which are specific to the maintenance of mouse primordial germ 
cells (PGC) during embryo development 25 , all showed expression 
levels above the high expression threshold in FGSCs and SSCs 
(Fig. 3C). 

Adult sternness genes shared by FGSCs and SSCs. In the whole gene 
expression profiles, we detected 216 adult sternness genes that were 




Figure 3 | Microarray analysis of gene expression profiles of FGSCs and SSCs. (A) Heatmap for the global gene expression profiles of FGSCs and SSCs. 
The heatmap for global gene expression profiles shows that FGSCs and SSCs share similarities at the transcriptional level. (B) A pine diagram showing the 
overlap of highly expressed genes of FGSCs and SSCs. The numbers of respective highly expressed genes was shown corresponding to FGSC and SSC. The 
CEG represented by blue portion refers to 853 genes. The proportion of respective parts is shown on the diagram. (C) Expression levels of germ cell special 
genes in FGSCs and SSCs detected by the microarray. Germ cell markers include MVH, Dazl, Oct4 and Fragilis. PGC key genes, such as Tdrkh, Akt3, 
Gml673, Hba-al, MovlOll and Fkbp6 were expressed at similarly high levels in FGSCs and SSCs. (D) Expression heatmap for 216 sternness genes 
detected by the microarray show similar expression patterns in FGSCs and SSCs. In the heatmap, each row represents the expression level of one gene in 
the gene expression profiles of FGSCs and SSCs. Heatmap visualization was performed by gplots package in R. Red represents highly expressed; green 
means low expression value. 
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Table 1 | Comparison of the CEG with other stem cells overexpressed genes profiles 

ESC (%) NSC (%) HSC (%) 



CEG(853) 130(15.24) 177(20.75) 146(17.12) 

18 core genes AcadmAW549877,Ccs£oprs,Dtymk,Fam3cfhllfkbp9,Gsta4,Kras,Nmel,Pvrl3,Rasal^ 



proposed as molecular signatures of stem cells by Melton et al. 20 . 
Unsupervised hierarchical clustering of the detected adult sternness 
genes showed that they had very similar expression patterns between 
FGSCs and SSCs (Fig. 3D). Hierarchical clustering analysis classified 
the sternness genes into several obvious blocks. The sternness genes 
in the red block may be important for the sternness of FGSCs and 
SSCs. The number of CEG that overlapped with the profiles of 
overexpressed genes in mouse embryonic, neural, and hemato- 
poietic stem cells are shown in Table 1. The presence of hundreds 
overlapping genes implies that FGSCs and SSCs share sternness 
properties with other types of stem cells. From Table 1, it is clear 
that the CEG shared a larger overlap with neural stem cells than with 
embryo stem cells or hematopoietic stem cells, suggested that 
reproductive system share some genes expression patterns with 
nervous system. 

According to the overlap between the CEG and the genes over- 
expressed in the other three stem cell types, we classified the CEG 
into two subsets: (i) CEG~ stemness (323 genes) representing the CEG 
that overlapped with the overexpressed genes of the other three 
types of stem cells; (ii) CEG~ eerm (530 genes) representing the CEG 
that did not overlap with the overexpressed genes of the other 
three types of stem cells. Further analysis of the CEG~ stemness iden- 
tified 18 genes that act as core genes shared by germline, embry- 
onic, neural and hematopoietic stem cells (Supplementary Fig. S2 
online). The 18 core genes shared by CEG, ESC, HSC and NSC 
were listed in Table 1. 

The presence of adult sternness genes in the CEG was also con- 
sistent with previous studies that proposed that spermatogonial stem 
cells share some, but not all, phenotypic and functional characters 
with other stem cells 3 . Some of the CEG were not overexpressed in 
the other stem cells, indicating that these genes may represent germ- 
line stem cell-specific genes. 

Transcription factors enriched by motif screening of cis-regulatory 
elements of the CEG. Motif screening of promoter cis-regulatory 
elements of the CEG would identify transcription factors that 
regulate the expression of certain CEG. We applied EXPANDER 26 
to identify transcription factors whose binding site signatures were 
significantly over-represented in the CEG. Several interesting tran- 
scription factors were identified. Cell cycle-related transcription 
factors c-Myc 27 and Egr-l 2s were identified, which are consistent 
with the rapid cell division properties of FGSCs and SSCs. Motif 
screening also identified several transcriptional regulators that play 
key roles in stem cell maintenance and cell proliferation. Tran- 
scription factors HSF enriched several important genes, such as 



Uchll 29 Texl4 30 MII2 31 , which play important roles in maintenance 
of spermatogenesis. The enriched transcription factors and their 
target genes were shown in Supplementary Table S2. TF-binding 
site analysis of the CEG provided potential regulators to study 
mechanism of FGSC and SSC self-renewal. The list of identified 
transcriptional regulators will be a useful resource for scientists 
engaged in germline stem cell research. 

Gene ontology analysis of the CEG. To assess the biological 
relevance of the CEG, we performed gene ontology analysis using 
the DAVID web server 32 . Functional analysis of the 853 CEG 
enriched GO terms mainly belonging to the three functional 
clusters (biological process, cellular component, and molecular 
function). The CEG in the cluster of biological processes were 
mainly involved in maintaining transcription, cell cycle, chromatin 
organization, metabolic process and protein localization (Supple- 
mentary Table S3 online). This is consistent with the rapid mitotic 
activity of FGSCs and SSCs. A graphical representation of biological 
process functional classification is shown in Supplementary Fig. S3. 
As FGSCs and SSCs display similar morphology and germline stem 
cell properties during in vitro culturing, we intended to identify genes 
with roles in mammalian germline stem cell maintenance; therefore, 
we generated lists of CEG related to cell proliferation, cell division 
and cell cycle (see Table 2). The list of cell proliferation related genes 
includes several genes that have been found to play important roles in 
maintaining the self-renewal of SSCs, such as Uchll 29 and MovlOll 33 . 
The adaptor protein Numb is expressed in male germ cells. Inac- 
tivation of Numb may influence spermatogonial differentiation 34 . 
Irs2 is required for testicular development, but the exact role of 
Irs2 in SSC remains unknown 35 . Stabl regulates the self-renewal of 
hematopoietic stem cells by promoting quiescence and repressing 
commitment to differentiation 36 . Srrt, also known as Ars2 has been 
found to play important roles in maintaining neural stem cell 
identity 37 . Functional analysis of those genes related to cell prolifera- 
tion, cell division and cell cycle found that 17 genes involved in 
transcription regulation (see Table 2). The 17 transcription related 
genes were possibly as transcription regulators regulating expression 
of cell proliferation, cell division and cell cycle related genes. The lists 
of cell proliferation, cell division and cell cycle related genes provides 
several suggested starting points for direct studies on the mecha- 
nisms of germline stem cell proliferation. 

Hub proteins of the maximum continuous network constructed 
from the CEG. We hypothesized that the CEG would be responsible 
for the morphological and molecular similarities between FGSCs and 



Table 2 | The list of cell proliferation related genes enriched by GO analysis 

Go Term Enriched genes 

Cell proliferation (20) Uchll ,Movl0ll ,Trp53,Bcl2l2,Numb,Uhrf2,Nras,lrs2,Appll ,Gpxl ,Uhrfl ,lm P dh2,Srrt,Naa35,Nasp,Kclmla,Gml3841, 
Cepl20,Satbl,Morf4ll 

Cell division (34) Ccntl ,Mad2ll ,Lzts2,Zc3hcl ,Haus8,M cm 5,Birc5,Zwint,Anln,Evi5,Pard6g,Cdk2,Nup43,Katnal ,Ncaph,Misl 8a,lncenp, 

Anapc4,Cdca3,Anxa 1 1 ,Mau2,Tipin,Rnf8,Chfr / Sept2,Ncapd3,Plkl ,Cdc20,Cdca8,Cdc25a,Prcl ,MapreT,Cdkl ,Nek3 

Cell cycle (75) Mad2ll,Fbxo31 ,Lzts2,Zc3hcl , Gak,Dmtfl ,Rbm38,Mdm2,Zwint,Anln,Evi5,M<:rn6,Cdkn lc,Ppm ld,Cdk2,Katnal ,Blcap, 

Misl8a,NaspAnapc4,PpmlgApp,Chh,Gm4799,Ncapd3Jdrdl,Cdc20,Cdtl,Foxn3,Cdca8,Prtf 
Rabgapl,Cepl20,Ccntl,Haus8,Birc5,Npml,Zfp318Atm,Gm9385fard6g,Piwil2,Nup43 

Chtfl8,Chaflb,Cdca3AnxallAurkaJrp53NauZUhrf2Jipin,Rnf8W^ 
Ehmt2,Rassfl ,Cdc25a,Clspn,E2f6,Cdkl ,Nek3,Sirt7 
Transcription (17) Morf4ll,DmtflNcm5Ncm6,Gm4799foxn3,Kdmla,Ccntl,Nsun4,Zfp318,Uhrfl,Chafl^ 
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SSCs. Therefore, we wanted to identify key genes of the CEG that play 
important roles in the maintenance of germline stem cell-specific 
properties. According to the theory of the complex network of 
protein-protein interactions, we constructed a maximum continu- 
ous network from the CEG based on I2d database. The network 
consisted of 352 nodes connected via 862 edges. The degree index 
y value of the constructed maximum continuous network was 
1.7412. According to the Barabasi- Albert model, the constructed 
maximum continuous network was a scale-free graph. The nodes 
whose connectivity ranked in the top 10% of all nodes in the net- 
work act as hubs. The hubs of the network would be responsible for 
the similarities between FGSC and SSC. The list of the 35 hubs is 
shown in Table 3. The hubs were highlighted with red color in the 
constructed maximum continuous network (Fig. 4). Several hubs 
were associated with cell proliferation. For example, Arrb2 is a 
regulatory protein involved in progastrin induction of colonic cell 
proliferation 38 , and Aurka is involved in determining mitotic spindle 
orientation 39 . Scaffold protein Shcl temporally regulates the EGF 
signaling network 40 . The actual roles of these hub proteins in main- 
taining the properties of germline stem cells will be investigated in 
future studies. 

The constructed network provided some clues for the regulatory 
mechanism of germline stem cell development. After mapping Cdhl, 
Sall4 and Uchll in the constructed network, we obtained a possible 
regulation network (Fig. 5). Cdhl, a surface marker of SSC also 
known as E-cadherin 41 , Uchll co-localizes with Plzf in mouse 
SSCs 29 , and Sall4 is an antagonist of Plzf 42 . The three germline- 
specific genes above bridged the Trp53, Mdm2, Hspdl, Smnl and 
App hubs in the constructed network. 



Comparision of gene expression profiles between neonatal and 
adult FGSCs. Based on results from the microarray analysis above, 
we checked 18 sternness genes expression in neonatal (3 days old) 
and adult (6 weeks old) FGSCs. Using the RT-PCR analysis, we found 
that all of the 18 sternness genes are expressed in both FGSCs. 
Neonatal and adult FGSCs also express germline markers, such as 
MVH, Dazl, Pragilis and Oct4. While, pluripotency-associated genes 
expressed in PGCs, such as Nanog, Sseal and Sox2, were neither 
expressed in neonatal FGSCs nor adult FGSCs (Supplementary Fig. 
S5 online). 

Discussion 

During the process of embryo development, sex differentiation 
occurs at 11.5 days postcoitum (dpc); at 12.5 dpc the phenotypes 
of the male and female gonads are distinguishable from each other. In 
this manuscript, we found that FGSCs are morphologically and 
molecularly similar to SSCs. Comparison of the whole gene express- 
ion profiles between FGSCs and SSCs further confirmed their sim- 
ilarity; the two cell types had similar signatures of globe transcription 
and the expression patterns of the sternness genes were similar. 

Comparison of FGSCs and SSCs allowed us to identify conserved 
molecules related to the unique properties of germline stem cells. 
Eighteen genes were identified as core genes shared by germline stem 
cells, mouse embryonic, neural, and hematopoietic stem cells 
(Table 1). Several of these sternness genes have been found to play 
key roles in stem cell self-renewal. For example, Zfx (Zinc finger 
protein X-linked) controls the self-renewal and maintenance of 
embryonic stem cells 43,44 . The 18 sternness genes may play critical 
roles in the maintenance of stem cell properties. The detailed roles of 



Table 3 I Hubs of the maximum continuous network constructed from the CEGs 



Entrez Gene Id 


Symbol 


Definition 


Degree 


22059 


Tr P 53 


Transformation related protein 53 


44 


13627 


Eeflal 


Eukaryotic translation elongation factor 1 alpha 1 


40 


22628 


Ywhag 


Tyrosine3-monooxygenase activation protein 


32 


17246 


Mdm2 


Transformed mouse 3T3 cell double minute 2 


30 


18607 


Pdpkl 


3-phosphoinositide dependent protein kinase 1 


29 


227700 


Sh3glb2 


SH3-domain GRB2-like endophilin B2 


27 


216869 


Arrb2 


Arrestin, beta 2 


27 


26416 


Mapkl4 


Mitogen-activated protein kinase 14 


24 


20466 


Sin3a 


Transcriptional regulator SIN3A 


24 


18148 


Npml 


Nucleophosmin 1 


23 


15510 


Hspdl 


Heat shock protein 1 


21 


18195 


Nsf 


N-ethylmaleimide sensitive fusion protein 


21 


15078 


H3f3a 


H3 histone, family 3A 


21 


67160 


Eeflg 


Eukaryotic translation elongation factor 1 gamma 


20 


1 1966 


Atp6vlb2 


ATPase, H+ transporting, lysosomal VI subunit B2 


19 


17127 


Smad3 


SMAD family member 3 


18 


12534 


Cdkl 


Cyclin-dependent kinase 1 


18 


68275 


Rpal 


Replication protein Al 


17 


20595 


Smn 1 


Survival motor neuron 1 


17 


20416 


Shcl 


Src homology 2 domain-containing transforming protein CI 


16 


17217 


Mcm4 


Minichromosome maintenance deficient 4 homolog 


16 


227800 


Rabgap 1 


RAB GTPase activating protein 1 


15 


13589 


Mapre 1 


Microtubule-associated protein, RP/EB family, member 1 


15 


17219 


Mcm6 


Minichromosome maintenance deficient 6 


15 


58231 


Stk4 


Serine/threonine kinase 4 


14 


20878 


Aurka 


Aurora kinase A 


14 


12566 


Cdk2 


Cyclin-dependent kinase 2 


13 


18817 


Plkl 


polo-like kinase 1 


13 


1 1820 


App 


Amyloid beta precursor protein 


13 


319152 


Histlh3h 


Histone cluster 1 , H3h 


13 


21812 


Tgfbrl 


Transforming growth factor, beta receptor 1 


13 


17218 


Mcm5 


minichromosome maintenance deficient 5 


13 


76788 


KlhdclO 


Kelch domain containing 1 0 


13 


13209 


Ddx6 


DEAD (Asp-Glu-Ala-Asp) box polypeptide 6 


12 


16202 


Ilk 


Integrin linked kinase 


12 
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Figure 4 | An overview of the constructed maximum continuous network including 352 unique proteins of the CEG. The maximum continuous 
network includes 352 nodes and 862 edges. The size of nodes was corresponded to their degrees. The 35 hubs in the constructed network were highlighted 
with red colour. Core sternness genes (8 genes) shared by FGSCs and SSCs were labelled with yellow colour. Cell proliferation related genes (13 genes) 
locate in the network were labelled with blue colour. The remaining nodes in the network were painted green to discriminate from others. 



these genes in germline stem cell self-renewal will be investigated in 
future studies. 

Motif screening of the promoters of the CEG provided insights 
into the regulatory network of germline stem cell-specific genes. 
Transcription factors HSF enriched several important genes, such 
as Uchll 2 '' TexM™ MII2 3i , which have been found playing important 
roles in spermatogenesis. The functions of other HSF target genes in 
maintenance and self-renewal of germline stem cells will be revealed 
by future studies. Cell cycle-related transcription factor, Egr-1 
enriched 151 genes in the CEG~ Berm set. Several target genes of Egr- 
1 play important roles in stem cell self-renewal and maintenance. For 
example, Dyrklb promotes proliferation of immature male germ 
cells in mice 45 . Nme7 regulates several critical regulators in ESC 
self-renewal 46 . Thapll is essential for embryogenesis and the plur- 
ipotency of mouse embryonic stem cells 47 . Satbl regulates the self- 
renewal of hematopoietic stem cells 36 . 

As the limitation of confirmed interactions between proteins, the 
I2D database is incomplete. The protein-protein interaction database 
does not comprise all of the interactions between mouse genes. 



Although it is impossible to construct a network covering all of the 
CEG, the constructed network is also a useful resource for researches 
in germline stem cells. Although the incomplete information of pro- 
tein interactions, the constructed network still provide some clues for 
germline stem cell research. Most of cell proliferation related genes 
were located in the constructed continuous network (Supplementary 
Fig. S4A online). Extracted those cell proliferation related genes and 
their first neighbors from the network would form a continuous 
subnetwork. The continuous subnetwork consisted of 13 cell prolif- 
eration related genes, 16 hubs and another 26 nodes suggests a pos- 
sible regulation network responsible for germline stem cell 
proliferation (Supplementary Fig. S4A online). From the subnet- 
work, we can see that Appll interacted with Akt3 aparting from other 
cell proliferation related genes which means Appll may be play roles 
in some other different pathways. Appll acts as a critical regulator of 
the crosstalk between adiponectin signaling and insulin signaling 
pathways 48 . About the 18 core sternness genes shared by germline 
stem cells, embryonic stem cells, hematopoietic stem cells and neural 
stem cells, there are only 8 genes locate at the constructed network. 
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Figure 5 | A sub-network generated by key SSC genes Cdhl, Sall4 and Uchll. The three germline specific genes Cdhl, Sall4 and Uchll are bridged 
together by hub Mdm2 in the constructed network. There are another five hubs in the sub-network. They are Trp53, Mdm2, Hspdl, Smnl and App. 



The 8 core sternness genes connected with each other through 9 hubs 
(Supplementary Fig. S4B online). Those scattered nodes connected 
together through hubs confirmed the importance of hubs in main- 
taining the stability of network. To access the function of single hub 
will provide new sights for germline stem cell research. For example, 
Mdm2 may be bridge SSC special genes Cdhl, Uchll and SalU to 
maintaining germline stem cell properties. Functional analysis of the 
hubs indicated that most of the hubs were related to cell cycle, mitosis 
or acetylation, which is consistent with the mitotic activity of FGSCs 
and SSCs. 

In our previous studies, FGSCs from adult mouse ovaries display 
similar properties of FGSCs from neonatal ovaries 10 . In this study, we 
compared the gene expression profiles between neonatal and adult 
FGSCs by RT-PCR analysis. The result showed that the neonatal 
FGSCs have same gene expression profile with adult FGSCs, while, 
the neonatal FGSCs do not express the marker genes which highly 
expressed in PGCs. This means that the FGSCs are not the same as 
PGCs, which also consistent with our previous studies 10 . 

In summary, our studies compared the morphological and 
molecular characteristics of FGSCs and SSCs. Microarray analysis 
of the globe gene expression profiles of FGSCs and SSCs showed 
similar signatures at the transcriptional level, which may be respons- 
ible for the similar morphologies of the two germline stem cell popu- 
lations. Motif screening of the promoters of CEG and the constructed 
continuous network provided insights into the regulatory network of 
germline stem cell-specific genes. This CEG is a useful resource for 
scientists engaged in germline stem cell research. 

Methods 

Animals. CD-I mice were used in this study. Three -day- old or 6- week-old female 
and 6-day-old male mice were used to isolate germline stem cells. The Institutional 
Animal Care and Use Committee of Shanghai approved all the procedures used in 
this study, which were performed in accordance with the National Research Council 
Guide for Care and Use of Laboratory Animals. 

Isolation, purification and culture of germline stem cells. The FGSCs from 16 
ovaries of 3-day-old mice or from 4 ovaries of 6-week-old (adult) mice were isolated, 
purified and cultured using a previously described method 10 ' 12,49-51 . The SSCs from 12 
testes of 6-day-old mice were isolated, purified and cultured using the method of 
Yuan et al 52 . After culturing the germline stem cells for 2 days, alkaline phosphatase 
activity assay, BrdU labeling, immunofluorescence, RNA isolation and microarray 
hybridization were performed (see below). For microarray hybridization, we cultured 
the FGSC and SSC in a feeder-free culture system. The culture medium for germline 
stem cells comprised minimum essential a medium (Gibco, Grand Island, NY, USA). 



10% FBS (Front Biomedicals USA, Eatontown, New Jersey, USA), sodium pyruvate 
(1 mM, Sigma, St. Louis, MO, USA), non-essential amino acids (1 mM, Gibco), 
L-glutamine (2 mM, Sigma), [3-mercaptoethanol (0.1 mM, Sigma), LIF (10 ng/ml, 
Santa Cruz, Dallas, TX, USA), mouse epidermal growth factor (10 ng/ml, Peprotech, 
Rock Hill, NJ, USA), glial cell line -derived neurotrophic factor (10 ng/ml, Peprotech), 
basic fibroblast growth factor (10 ng/ml, Peprotech) and penicillin (15 ug/ml, 
Sigma). After 2 days of culture, primary oocytes were easily discernible from 
proliferated FGSCs; the oocytes could be picked out using micropipette under a 
stereomicroscope. The remaining cells comprised the FGSCs. 

Alkaline phosphatase activity assay. The cultured germline stem cells were fixed 
with 4% paraformaldehyde for no more than 2 min at room temperature. The 
alkaline phosphatase activity of the germline stem cells was measured using an 
alkaline phosphatase activity detection kit, according to the manufacturer's 
instructions (Millipore, Billerica, MA, USA). 

Hoechst 33342 staining. After incubated with Hoechst 33342 (1 ug/ml, Sigma) at 
room temperature for 20 min, the FGSCs were rinsed twice in PBS and then ready for 
analysis. 

BrdU labeling. BrdU (50 ug/ml, Sigma) was added to germline stem cell cultures for 
5 h. The germline stem cells were then ready for immunofluorescence of BrdU 
staining. 

Immunofluorescence. Cultured FGSCs and SSCs were washed twice with PBS 
(phosphate-buffered saline) and then fixed for 15 min with 4% paraformaldehyde at 
room temperature. After fixation, the cells were rinsed twice in PBS and then 
incubated with PBS containing 0.1% (v/v) Triton X-100 for 15 min, at room 
temperature. After rinsing, the fixed cells were blocked with 10% normal goat serum 
in PBS for 10 min at room temperature. The cells were then incubated at 4°C 
overnight with primary antibodies: rabbit polyclonal anti-MVH (1 : 180 dilution; 
Abeam, Cambridge, MA, USA), rabbit polyclonal anti-Oct4 (1 : 150 dilution; 
Chemicon, Temecula, CA, USA), and mouse monoclonal anti-BrdU (1 : 150 dilution; 
Lab Vision Corporation, Fremont, CA, USA). After incubation with primary 
antibodies, the cells were rinsed twice in PBS, and then incubated in darkness with a 
fluorescein isothiocyanate-conjugated secondary antibody either goat anti-rabbit IgG 
or goat anti-mouse IgG (1 : 180 dilution; Proteintech, Chicago, USA) for 60 min at 
37°C, and then rinsed and the nuclei were stained by 4', 6-diamidino-2-phenylindole 
(DAPI) (1 : 1000 dilution; Sigma)-containing PBS for 10 min at room temperature. 
After washing twice with PBS, the cells were examined in fresh PBS under an inverted 
fluorescence microscope (Olympus, Tokyo, Japan). 

RT-PCR. Total RNA from germline stem cells was extracted using the Trizol reagent 
(Qiagen Ltd, Crawley, UK), according to the manufacturer's instructions. After 
extraction, 1 ug of total RNA was used to synthesize cDNA with Primescript Reverse 
Transcriptase (TaKaRa, Dalian, China). Primers sequences were listed in 
Supplmentary Table S5. 

RNA isolation and microarray hybridization. RNA from germline stem cells 
cultured for 2 days (see above) was pooled in equal concentrations. Three respective 
replicates of FGSC and SSC RNA that passed quality check were hybridized to the 
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Mouse WG-6 v2.0 microarray, according to the manufacturer's instructions 
(Illumina, San Diego, CA). The data discussed in this publication have been deposited 
in NCBI's Gene Expression Omnibus and are accessible through the GEO Series 
accession number GSE51313. 

Data analysis and gene ontology analysis. Data analysis was conducted in the R 
software environment for statistical computing and graphics. The Lumi package was 
used to process the raw data exported from GenomeStudio (Illumina) 53 . We detected 
19004 probes expressed based on the cut-off p-value (p < 0.01), representing the 
whole gene expression profiles of FGSCs and SSCs. To compare the global similarity 
of the whole gene expression profiles between FGSCs and SSCs, gplots package in R 
was used for unsupervised hierarchical clustering. We set the normalized gene 
expression value at 8.0 as a threshold to obtain highly expressed genes. Housekeeping 
genes were removed from the highly expressed gene list. These comprised well- 
known housekeeping genes and homologs of identified human housekeeping genes 54 . 
The housekeeping gene list is shown in Supplementary Table S4. The profiles of 
overexpressed genes in mouse embryonic stem cells, neural stem cells, and 
hematopoietic stem cells used to compared with CEG were referenced to Miguel's 
work 20 . The CEG were uploaded to the database for annotation, visualization and 
integrated discovery (DAVID) for gene ontology (GO) enrichment 32 . Enriched 
functional categories (p < 0.01) were identified in the GO analyses of CEG. 

Computational analysis of promoter cis-regulatory elements of the CEG. For 

promoter analysis we applied EXPANDER 26 to detect cis-regulatory promoter 
elements that control the expressions of the CEG shared by FGSCs and SSCs. The 
whole set of expressed genes was used as a background set of promoters. Bonferroni 
correction in EXPANDER was used to identify significant transcription factors (TFs). 
The promoter sequences were used for screening spanned 1000 bp upstream and 
200 bp downstream of the transcriptional start sites. 

Construction of the maximum continuous protein-protein interaction network of 
the CEG by breadth first search. According to the theory of complex networks of 
protein -protein interactions, we constructed the interaction network of the CEG 
based on I2D database. I2D version 2.0 includes 150,230 protein-protein interactions 
for mouse 55,56 . After converting the Entrez IDs of the CEG into the corresponding 
Swiss-Prot Id, we mapped the CEG to the 150,230 protein -protein interactions. We 
then compiled a PERL script to construct maximum continuous protein-protein 
interaction network of the mapped the CEG by breadth first search. We defined the 
hub proteins as those whose connectivity was in the top 10% of the connected nodes 
in the maximum continuous network. Cytoscape 3.0.2 was used to visualize and 
analysis the maximum continuous protein -protein interaction network 57 . 
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